AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Visual Instruction Understanding

# Visual Instruction Understanding

Uground V1 72B
Other
UGround is a powerful GUI visual localization model trained with a simple recipe, focusing on image-text-to-text multimodal tasks.
Image-to-Text Transformers English
U
osunlp
129
4
Co Instruct
Co-Instruct is a vision-language model focused on image-to-text generation tasks, capable of analyzing image content and generating relevant textual descriptions or answering questions about images.
Image-to-Text Transformers
C
q-future
470
19
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase